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Abstract 

This paper develops a game-theoretic framework for the design and analysis of a new class of 
incentive schemes called intervention schemes. We formulate intervention games, propose a solution 
concept of intervention equilibrium, and prove its existence in a finite intervention game. We apply our 
framework to resource sharing scenarios in wireless communications, whose non-cooperative outcomes 
without intervention yield suboptimal performance. We derive analytical results and analyze illustrative 
examples in the cases of imperfect and perfect monitoring. In the case of imperfect monitoring, in- 
tervention schemes can improve the suboptimal performance of non-cooperative equihbrium when the 
intervention device has a sufficiently accurate monitoring technology, although it may not be possible to 
achieve the best feasible performance. In the case of perfect monitoring, the best feasible performance 
can be obtained with an intervention scheme when the intervention device has a sufficiently strong 
intervention capability. 

Index Terms 

Game theory, incentives, intervention, resource sharing, wireless communications. 

I. Introduction 

When self-interested users share resources non-cooperatively, it is common that the resources 
are utilized suboptimally from a global point of view [2J. Hence, overcoming the suboptimal 
performance of non-cooperative outcomes poses an important challenge for successful resource 
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Utilization. The aforementioned phenomenon is widely observed in wireless communications, 
where users compete for radio resources interfering with each other. For the sake of discussion, 
consider the following abstract scenario of resource sharing in communications. First, users 
determine their resource usage levels, which in turn determine the service quality they receive. 
In general, as the overall usage level increases, the service quality is reduced due to interference 
or congestion. The payoff of a user is determined by its own usage level as well as the service 
quality. In such a scenario, users tend to choose a higher usage level than the socially optimal 
one. That is, it is in the self-interest of users to choose a high usage level, although reducing their 
usage levels simultaneously would benefit all of them. In game theory, such a conflict between 
private and social interests is modeled as the prisoner's dilemma game. In the literature, it 
has been shown that various wireless communication scenarios exhibit a prisoner's dilemma 
phenomenon, including packet forwarding lO, distributed spectrum allocation [4], and medium 
access control (CSMA/CA jH and slotted Aloha JSl). 

Incentive schemes are needed to improve the performance of non-cooperative outcomes. In this 
paper, we propose a class of incentive schemes based on the idea of intervention. Implementing 
an intervention scheme requires an intervention device that is able to monitor the actions of users 
and to affect their resource usage. An intervention manager first chooses an intervention rule 
used by the intervention device, and then users choose their actions knowing the intervention 
rule chosen by the manager. After observing a signal about the actions of users, the intervention 
device chooses its action according to the intervention rule. The manager chooses an intervention 
rule to maximize his payoff, anticipating the rational behavior of users given the intervention rule. 
The payoff of the manager can be considered as a measure of the system performance, which 
can incorporate various efficiency and fairness criteria. We formulate the interaction between 
users and a manager as an intervention game and propose a solution concept called intervention 
equilibrium. Intervention equilibrium predicts the outcome of an intervention game in terms of 
an intervention rule chosen by the manager and an operating point chosen by users. 

Intervention can be classified into two types, called type 1 and type 2, depending on how the 
intervention device acts in the system relative to users. In type-1 intervention, the intervention 
device acts in a symmetric way as users do while having the ability to monitor the actions of 
other users. An example of type-1 intervention can be found in ff] and jHl, which consider a 
random access network where an intervention device interferes with other users by transmitting 
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its packets after obtaining information about the transmission probabilities of users. In type-2 
intervention, the intervention device acts as a gatekeeper which can control resource usage by 
users. An example of type-2 intervention can be found in |l9l and IfTOl . analyzes scheduling 
mechanisms where a scheduler assigns different priorities to traffic flows depending on their 
input rates, and [lOJ considers a packet dropping mechanism where the server determines the 
probability of dropping packets as a function of the total arrival rate. The two types of intervention 
can be applied to the aforementioned resource sharing scenario, as schematically shown in Fig. [B 
The goal of intervention schemes to improve the performance of non-cooperative outcomes is 
illustrated in Fig. [21 with two users and the system performance measured by the average payoff 
of the two users. Our analysis is aimed at answering the following two questions. 

1) When can we construct an intervention scheme that improves the suboptimal performance 
of non-cooperative equilibrium? 

2) When can we construct an intervention scheme that achieves the best feasible performance? 
Our analysis suggests that the answers to these questions depend on the ability of the intervention 
device: 

• Ability to monitor the actions of users (i.e., monitoring technology), 

• Ability to affect the payoffs of users through its actions (i.e., intervention capability). 
The discussion on the example in Section IIII-BI shows that an intervention scheme can improve 
the performance of non-cooperative equilibrium when the monitoring technology is sufficiently 
accurate. This result is reinforced by the analytical results and the example in Section |Wl 
which considers the case of perfect monitoring. The analytical result in Section IIII-AI shows 
that intervention schemes may not achieve the best feasible performance when the monitoring 
technology is noisy. On the other hand, the analytical results and the example in Section |IV] 
show that intervention schemes can achieve the best feasible performance when monitoring is 
perfect and the intervention device has a sufficiently strong intervention capability. When signals 
are noisy, the manager can provide incentives by triggering a punishment following signals that 
are more likely to occur when users deviate. When these signals occur with positive probability 
even when users do not deviate, punishment happens from time to time at equilibrium, which 
results in a performance loss. On the contrary, when signals are perfectly accurate, punishment 
through intervention can be used only as a threat, which is never used at equilibrium. Thus, in 
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the case of perfect monitoring, it is possible for the manager to achieve a desired operating point 
without incurring a performance loss. 

The rest of this paper is organized as follows. In Section [III we formulate intervention 
games, develop a solution concept of intervention equilibrium, and show its existence in a finite 
intervention game. In Sections Un] and |Wl we derive analytical results and discuss illustrative 
examples in the cases of imperfect and perfect monitoring, respectively. In Section|Vl we compare 
intervention schemes with existing approaches in the literature. In Section |Vll we conclude. 

II. Intervention Games and Intervention Equilibrium 

We consider a system (e.g., a wireless network) where N users and an intervention device 
interact. The set of the users is finite and denoted by A/" = {1, . . . , N}. The action space of user 
i is denoted by Ai, and a pure action for user i is denoted by Oj G Ai, for all i E M . A pure 
action profile is represented by a vector a = (ai, . . . , otv), and the set of pure action profiles is 
denoted hy A = HiGA^^*- ^ rnixed action for user i is a probability distribution over Ai and is 
denoted by E A(Ai), where A(X) is the set of all probability distributions over a set X. A 
mixed action profile is represented by a vector a = (ai, . . . , on) G nieA/'^(^«)- ^ mixed action 
profile of the users other than user i is written as a-i = (ai, . . . , ttj+i, . . . , oat) so that a 
can be expressed as a = (oj, a_i). Once a pure action profile of the users is determined, a signal 
is realized from the set of all possible signals, denoted Y, and is observed by the intervention 
device. We represent the probability distribution of signals by a mapping p : A ^0^)- That is, 
p(a) E A(F) denotes the probability distribution of signals given a pure action profile a. When 
Y is finite, the probability that a signal y is realized given a pure action profile a is denoted 
by p{y\a). After observing the realized signal, the intervention device takes its action, called an 
intervention action. We use oq, ao, and Aq to denote a pure action, a mixed action, and the set 
of pure actions for the intervention device, respectively. 

Since the intervention device chooses its action after observing the signal, a strategy for it 
can be represented by a mapping f : Y A(Aq), which is called an intervention rule. That is, 
f{y) E A(y4o) denotes the mixed action for the intervention device when it observes a signal y. 
When Aq is finite, the probability that the intervention device takes an action given a signal y is 
denoted by /(aoll/)- The set of all possible intervention rules is denoted by F. There is a system 
manager who determines the intervention rule used by the intervention device. We assume that 
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the manager can commit to an intervention rule, for example, by using a protocol embedded in 
the intervention device. The payoffs of the users and the manager are determined by the actions 
of the intervention device and the users and the realized signal. We denote the payoff function 
of user i e Af hy Ui : Aq X A X Y ^ M and that of the manager hy uq : Aq x A x Y ^ M. 
We call the pair (Y, p) the monitoring technology of the intervention device, and call Aq its 
intervention capability. An intervention device is characterized by these two, and we represent 
an intervention scheme by ((F, p), Aq, /). 

The game played by the manager and the users is formulated as an intervention game, which 
is summarized by the data 

r = (A/'o, (Ai)jgAro, (Mi)ieA/-o, O^^P)) , 

where A^o = AT U{0}. The sequence of events in an intervention game can be listed as follows. 

1) The manager chooses an intervention rule / G J-". 

2) The users choose their actions a E J^.^^ A(y4j) simultaneously, knowing the intervention 
rule / chosen by the manager. 

3) A pure action profile a is realized following the probability distribution a, and a signal 
y gY is realized following the probability distribution p{a). 

4) The intervention device chooses its action oq G Aq following the probability distribution 

m- 

Ex ante payoffs, or expected payoffs given an intervention rule and a pure action profile, can 
be computed by taking expectations with respect to signals and intervention actions. The ex ante 
payoff function of user i is denoted by a function t>j : x A — t- R, while that of the manager is 
denoted hy Vq : J-' x A M.. We say that an intervention game is finite if Ai, for i E Ao, and 
Y are all finite. In a finite intervention game, ex ante payoffs can be computed as 

Vi{f,a) = Y^ Ui{ao,a,y)f{ao\y)p{y\a), 

yeY aoGAo 

for all i E Aq. Once the manager chooses an intervention rule /, the users play a simultaneous 
game, whose normal form representation is given by 

= (A/", {Ai)i^x,M.-))i€M) ■ 

We predict actions chosen by the users given an intervention rule / by applying the solution 
concept of Nash equilibrium to the induced game Tf. With an abuse of notation, we extend 
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the domain of Vi to T x A(Aj) for all i G A/q by taking expectation with respect to pure 

action profiles. 

Definition 1: An intervention rule f E J-' sustains an action profile a* E fljg^ A(y4j) if a* 
is a Nash equilibrium of the game Tj, i.e., 

Vi{f, a*, a*_i) > Vi{f, ai, alj for all ai E A{Ai), for all i E J\f . 

An action profile a* is sustainable if there exists an intervention rule / that sustains a* . 

Let £{f) C Y\.ieM ^(^0 be the set of action profiles sustained by /. We say that a pair (/, a) 
is attainable if a E £{f). The manager's problem is to find an attainable pair that maximizes 
his ex ante payoff among all attainable pairs, which leads to the following solution concept for 
intervention games. 

Definition 2: {f*,a*) E x Yl-^j^ A{Ai) is an intervention equilibrium if a* E £{f*) and 

fo(/*, a*) > fo(/, a) for all (/, a) such that a E £{f). 

/* G J-" is an optimal intervention rule if there exists an action profile a* E Y\_i<zj^ ^{^i) such 
that (/*,«*) is an intervention equilibrium. 

An intervention equilibrium solves the following optimization problem: 

maxwo(/, a) subject to a G £{f). (1) 

(/.") 

The constraint a E £{f) represents incentive constraints for the users, which require that the users 
choose the action profile a in their self-interest given the intervention rule /. The problem ([U) can 
be rewritten as maxjg7rmaxag£-(j) fo(/, a). Then an intervention equilibrium can be considered 
as a subgame perfect equilibrium (or Stackelberg equilibrium), with an implicit assumption that 
the manager can induce the users to choose the best Nash equilibrium for him in case of multiple 
Nash equilibria. Our interpretation is that, in order to achieve an intervention equilibrium (/*, a*), 
the manager announces the intervention rule /* and recommends the action profile a* to the 
users. Since a* E £{f*), the users do not have an incentive to deviate unilaterally from a*, and 
a* becomes a focal point [fTTI of the game Fp. Below we show the existence of an intervention 
equilibrium in a finite intervention game. 

Proposition 1: Every finite intervention game has an intervention equilibrium. 

We prove Proposition \T\ using the following two lemmas. 
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Lemma 1: The correspondence E : T ^ W^^j^ [S.{A^ is nonempty, compact-valued, and 
upper hemi-continuous. 

Proof: We can show that, for any / G J-", the set <£^(/) is nonempty by applying Nash 
Theorem [fT2l to F/. Since njGA^^(^«) bounded, it suffices to show that E has a closed 
graph to prove that E is compact- valued and upper hemi-continuous (u.h.c.) (see Theorem 3.4 of 
ifTBl ). Choose a sequence {(/", a")} with (/", a") — )■ (/, a) and a" G for all n. Suppose 

that a ^ Then there exists i E M such that is not a best response to a^i in V f. Then 

there exist e > and a\ such that f > f a^, + 3e. Since is continuous and 

(/", a") — )■ (/, a), for sufficiently large n we have 

which contradicts a"^ E E{f^). ■ 
Define a function t;o : — M by t'o(/) = maxQ,gf(/) fo(/, a). For each /, is nonempty 
and compact by Lemma [Hand vo{f, ■) is continuous. Hence, the function vq is well-defined. 
Lemma 2: The function vq is upper semi-continuous. 

Proof: Let E{f) = {a e E{f) : fo(/, a) = ^^o(/)}- Note that E{f) is nonempty for all 
/. Fix /, and let {/"} be any sequence converging to /. Choose G for all n. Let 

Vq = limsup„^oo vo{f'^). Then there exists a subsequence {Z"''} such that t>Q = limt>o(/"*, a"''). 
Since a" G E{f^) and £^ is u.h.c, there exists a convergent subsequence of {a"''}, called {a-'}, 
whose limit point a is in Hence, Vq = \imvo{f^ , a^) = vo{f,a) < vo{f) since a G E{f). 

■ 

Note that the space of intervention rules, J^, is equivalent to (A(Ao))'^', which is compact. 
Therefore, a solution to max/gjrt'o(/) exists, which establishes the existence of an intervention 
equilibrium. This completes the proof of Proposition [B 

There can be multiple intervention equilibria, all of which yield the same payoff for the 
manager. We can propose different selection criteria for the manager to choose an intervention 
equilibrium out of multiple ones. For example, the discussion on affine intervention rules in 
Section IIV-AI is motivated by the robustness of performance to mistakes by the users as well as 
simplicity. 

Recall that an intervention device is characterized by (Y, p) and Aq. In this paper, we focus on 
the problem of finding an optimal intervention rule when the manager has a particular intervention 
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device. However, we can think of a scenario where the manager can select an intervention 
device from multiple ones given the operating cost of each available intervention device. Our 
analysis in this paper allows the manager to evaluate the optimal performance achieved with 
each intervention device. He can then select the best intervention device taking into account 
both performance and cost. 

III. Performance with Intervention under Imperfect Monitoring 
A. Analytical Results 

In this section, we maintain the following assumption. 

Assumption 1: There exists an action for the intervention device So G Aq that satisfies 

Mo(ao, a, y) > no(ao, a, y) for all ao ^ So, 

for all a E A and y eY. 

Assumption \T\ asserts the existence of an intervention action that is most preferred by the 
manager regardless of the action profile of the users and the signal. We can interpret the most 
preferred intervention action, So, as the intervention action that corresponds to no intervention. 
Then Assumption [U states that exerting intervention is costly for the manager, reflecting that 
intervention typically degrades the overall performance. Moreover, there is some operational 
cost (e.g., energy consumption) needed to exert intervention. 

Define an intervention rule / by f(y) = So for all y. It can be considered that the man- 
ager decides not to intervene at all when he chooses /. Let vq = sup(j^) i;o(/, a), Vq = 
sup J sup^g£-(j) fo(/, a), and Vq = sup^g^^^^ Vo{f, a). Vq is the best performance that the manager 
can obtain when the users are not subject to the incentive constraints (e.g., when the actions 
of the users can be completely controlled by the manager). Vq is the best performance when 
the manager is required to satisfy the incentive constraints for the users. Lastly, vq is the best 
performance when the manager does not engage in active intervention. It is straightforward to see 
that Vq < Vq <vo. The following proposition provides a sufficient condition on the intervention 
game for a gap between vq and to exist. 

Proposition 2: Suppose that the intervention game is finite, p has full support (i.e., p{y\a) > 
for all y and a), and there is no a such that a E £{f) and t'o(/, «) = vq. Then < vq. 
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Proof: Suppose that the conclusion does not hold, i.e., = vq. Since the intervention 
game is finite, is attained by Proposition [U Thus, there exists (/, a) such that a E £{f) and 
Vo{f, a) = vq. Note that vq = vo{f, a) < vo{f, a) < vq. Hence, vo{f, a) = Voif, a). Since p has 
full support, we have f{y) = f{y) for all y. This contradicts the hypothesis that there is no a 
such that a E £{f) and fo(/, a) = vq. ■ 
When the intervention game is finite, vq is attained since vq is continuous and (J^xJ^.^^ A(Aj)) 
is compact. Since fo(/, a) > fo(/, a) for all a, for all /, we have vq = maxQ,t>o(/, a). In fact, 
when the intervention game is finite and p has full support, / is the only intervention rule that 
can attain the best feasible performance, vq. When / sustains no action profile that attains vq, the 
manager needs to trigger a punishment following some signals in order to provide appropriate 
incentives for the users to follow an action profile such that VQ(f,a) = Vq. However, since p 
has full support, the punishment results in a performance loss, which prevents the manager from 
achieving vq. 



B. Illustrative Example (Type-2 Intervention) 

We consider a wireless network where two users interfere with each other. Each user has two 
pure actions, ul and uh, which represent low and high resource usage levels, respectively, and 
satisfy < < an- The service quality is determined randomly given an action profile, and 
there are two possible quality levels, y and y, with < y < y. The service quality is realized 
following the distribution 

p, if a = (ol, ol), 
piyW) = { q, if a = {aH,aL) or {ai^au), 
r, \fa = {aH,aH), 

where 0<r<g<]9<l. The intervention device in this example acts as a gatekeeper (i.e., 
type-2 intervention) after observing the service quality, having two pure actions: intervene (ao) 
and not intervene (So). When the intervention device does not intervene, a user receives a payoff 
given by the product of the quality level and its own usage level, i.e., Ui{do,a,y) = yai for all 
a and y, for i = 1, 2. When the intervention device does intervene, the service stops completely 
and a user receives zero payoff regardless of its usage level, i.e., Mj(ao, a,y) = for all a 
and y, for i = 1,2. The payoff of the manager is set as the average payoff of the users, i.e.. 
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Uo{ao,a,y) = [ui{ao,a,y) + U2{ao,a,y)]/2. Note that denoting the action of not intervening 
by So is consistent with Assumption [TJ A communication scenario that fits into this example is 
presented in Fig. [3l 

Since there are only two pure actions for the intervention device, we can represent = [0, 1] 
and use f{y) as the probability of not intervening given the signal y. The ex ante payoff function 
of user i is given by 

Vi{f,a) = [p{y\a)f{y)y+{l-p{y\a))f{y)y]ai. 

The payoff matrix of the game T j, i.e., the game when the intervention device does not intervene 
at all, is displayed in Table HI where we define yk = ky + (1 — k)y, for k = p,q, r. We assume 
that the game is the prisoner's dilemma game, i.e., Vqan > ypdi > VrO-H > Uqa-L and 
2?/pCiL > yq{ciH + cll)- Then without any intervention, it is the dominant strategy of each user to 
choose the high usage level, which results in the inefficient Nash equilibrium. The manager aims 
to improve the inefficiency of the Nash equilibrium by providing appropriate incentives through 
interventionO We restrict attention to symmetric action profile, assuming that the manager desires 
to sustain a symmetric action profile. 

Let wo(«) = supj{fo(/, a) : a G £{f)}- That is, wq{q) is the maximum payoff that the 
manager can obtain while sustaining a given action profile a. Since we focus on symmetric 
action profiles and there are only two pure actions for each user, let a E [0, 1] denote the 
probability of each user playing a^. Then we can show that Wo(0) = yrO-n and, for a E (0, 1], 

{i<]-r)+a[{p-q)-{q-r)]}aHaL jj 

[(l-r)aH-(i-q)aL]+a[ipaL-qaH)-iqaL-raH)] ^' 

Wo[a) = < if a{paL - qau) + (1 - «)(gaL - ran) > 0, (2) 

0, otherwise. 

The intervention rule that attains Wo(0) is given by / (i.e., no intervention), while the intervention 
rule that attains wq^o), for a E (0, 1], is given by 

{qaL - ran) + a[{paL - qau) - {qai - ran)] y 



f{y) = l and fiy) 



[(1 - r)aH - (1 - q)aL] + a[{paL - qa-u) - {qo-L - ran)] y 



'in this paper, we focus on the role of intervention schemes to improve the prospect of cooperation by applying intervention 
to prisoner's dilemma situations. Intervention schemes can also be used to help users achieve coordination by eliminating the 
multiplicity of Nash equilibria in coordination games such as the battle of the sexes and the stag hunt [11|. For example, in the 
stag-hunt game, an intervention scheme may induce players to choose the payoff dominant (but not risk dominant) "all stag" 
equilibrium by intervening in the hare hunt. 
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if aipai — qafj) + (1 — Oi){qaL — ran) > 0, and by f(lj) = f{y) = otherwise. We can think 
of a{paL — qdn) + (1 ~ OL){qciL — ^clh) as a measure of the sensitivity of signals between the 
two pure actions when the other user plays a. When signals are sufficiently sensitive at a, an 
intervention rule can sustain a with a positive payoff by degrading the low quality only. On the 
contrary, when signals are not sensitive, destroying all the payoffs is the only method to sustain 
a, which yields zero payoff for the users. Note that, when pai — qan < and qa^ — rau > 0, 
the pure action profile {aL.ai) cannot be sustained with a positive payoff while a completely 
mixed action profile can be. In this case, signals are more sensitive to the action of a user when 
the other user plays an- Hence, by inducing the users to play au with positive probability, 
the manager can make the signal a more informative indicator of a deviation. This allows the 
possibility that an intervention rule improves the performance of non-cooperative equilibrium by 
sustaining a completely mixed action profile even when the social optimum (0^,0^) cannot be 
sustained non-trivially. A similar discussion about the advantage of using mixed actions can be 
found in lfT4ll in the context of the repeated prisoner's dilemma game. 

In this example, we have vq = wo{0) = yrdH, vq = ^pflL, and Vq = maXag[o,i] wo(tt). We 
summarize the results about the performance with intervention, Vq, in the following proposition. 

Proposition 3: (i) Suppose that (a) pa^ — qau < and qa^ — ra^ < 0, or (b) pa^ — qa^ < 
qai — ran and {p — q){l — r) — {q — r)(l — g) < 0. Then Vq = vq. 

(ii) Suppose that (c) pai — qau > qai — ran > 0, (d) pai — qau > > qai — ran, or (e) 
< pcLL — qO'H < qo-L — fdH and (p — g)(l — r) — (g — r)(l — g) > 0. Then = max{vQ, wo{l)}. 

(iii) Suppose that (f) pa^ — qan < < ga^ — ran and (p — g)(l — r) — (g — r)(l — g) > 0. Then 
Vq = max{vo,wo{a)}, where 

_ qaL - ran 

{qai - ran) - {pai - qan) ' 

Proof: See Appendix lAl ■ 

Fig. in shows that each of the three cases of Vq = vq, Vq = wo{a), and Vq = wo{l) can arise 

depending on the parameter values. To obtain the results, we set ai = 1, an = 1-19, y = 5, 

y = 1, q = 0.8, and r = 0.65 while varying p = 0.9, 0.94, 0.96. We can see that, as p increases, 

the performance with intervention improves, getting closer to its upper bound Vq. In fact, when 
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wo{l), we have 



* _ (1 - p)aL{yqaH - ypdi) ^ n 
(1 - gja// - (1 -p)aL 



which is consistent with Proposition [2l The gap between Vq and Uq vanishes as p approaches 
1, while it increases with the deviation gain (yqan — ypdh)- This result is intuitive because 
punishment rarely occurs when p is close to 1 while a stronger punishment is needed as the 
deviation gain is larger. 

We can consider pricing schemes applied to this example, by having the manager charge 
different payments depending on the realized service quality. In order to find a pricing scheme 
that sustains a certain action profile, the manager needs to know how payments affect the payoffs 
of the users (i.e., the function Mi(ao, a, y), where oq is now interpreted as the charged payments). 
Suppose, for example, that the payoff of each user is given by its data rates. Since intervention 
influences data rates directly, it is relatively easy to find out how intervention actions affect 
payoffs. In contrast, finding out how payments affect payoffs requires the manager to know how 
the users value payments relative to data rates. This information is difficult to obtain since the 
users' valuations are subjective and thus not easily measurable. This discussion points out the 
informational advantage of intervention over pricing. 

IV. Performance with Intervention under Perfect Monitoring 
A. Analytical Results 

In this section, we consider the case where the intervention device can observe the pure action 
profile without errors (i.e., perfect monitoring), as stated formally in the following assumption. 

Assumption 2: Y = A, and only signal a can arise in the distribution p(a) for all a E A. 

With Assumption |2l we always have y = a, and thus we write the payoff functions more 
compactly as Uj(ao, a) instead of Ui{aQ, a, a), for all i E Aq. We also maintain the following two 
assumptions in this section. 

Assumption 3: There exists an action for the intervention device E Aq that satisfies, for all 

Ui{qQ, a) > Ui{aQ, a) for all ag E Aq, for all a E A. (3) 
Assumption 4: Aq is compact, and Ui : Aq x A ^ is continuous for all i E A/q. 
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Assumption |3] States that there exists an intervention action that is most preferred by the users 
and the manager regardless of the action profile of the users. We can interpret ag in Assumption[3] 
as the intervention action corresponding to no intervention, similarly to clq in Assumption[T] Then 
Assumption [3] implies that intervention can only reduce the payoffs of the users and the manager. 

In this section, we restrict attention to pure actions (both for the users and for the intervention 
device) while allowing the action spaces to be continuous spaces. Thus, an intervention rule is 
represented by a mapping f : A ^ Aq, while user i chooses a pure action E Ai given an 
intervention rule. Then the ex ante payoff function is given by Vi{f,a) = Ui{f{a),a), for all 
i E Aq. We define a class of intervention rules. 

Definition 3: : A — )■ is an extreme intervention rule with target action profile a E A if 
fa satisfies 

• fa{o) £ argminaoeAo ^j(«0) a) \i3 i E M such that Oj ^ fij and = Vj ^ i, and 

• fa{o) = Otherwise. 

By Assumption m argmina^g^Q Mj(ao, a) is non-empty for all a E A and i E M. Thus, for 
every a E A, there exists an extreme intervention rule with target action profile a. An extreme 
intervention rule prescribes an intervention action that minimizes the payoff of the deviator if 
there is a unilateral deviation from the target action profile while prescribing no intervention 
if there is no unilateral deviation. Hence, an extreme intervention rule provides the strongest 
incentive for the users to follow a given target action profile. Let £{J-') = Uf^jr£(f). That is, 
£{J-') is the set of all sustainable action profiles. 

Lemma 3: If a* E £{J^), then a* E £{fa*)- 

Proof: Suppose that a* E S{J^)- Then there exists an intervention rule / such that a*) > 
Vi{f, ai, a'i-) for all E Ai, for all i E M . Then we obtain Vi{fa*,a*) = Uj(ag,a*) > 
Ui{f{a*),a*) > Ui{f{ai,a*_i),ai,a*_i) > Ui{fa'{ai,a*_f),ai,a*_i) = Vi{fa*,ai,a*_f) for all ^ a*, 
for all i E J\f, where the first inequality follows from Q and the third from the definition of 
extreme intervention rules. ■ 

Let £* = {a E A : a E £{fa)}. The following results are the consequences of Lemma [3l 

Proposition 4: (i) £{J^) = S*. 
(ii) If (/*,a*) is an intervention equilibrium, then (fa*, a*) is also an intervention equilibrium. 

Proof: (i) Let a* E S*. Then a* E S{fa*) C S{T). Hence, £* C S{T). The other inclusion 
S{J^) C S* follows from Lemma [3l 
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(ii) Suppose that (/*, a*) is an intervention equilibrium. Then by Definition[2l a* E £{f*) and 
v^if^a*) > Vo{f,a) for all {f,a) e x A such that a e S{f). Since a* e S{T), a* e £{fa*) 
by Lemma |3l Hence, fo(/*,a*) > VQ{fa*,a*). On the other hand, since /a*(a*) = o-O' 
have vo{f*,a*) < vo{fa',a*) by ©. Therefore, vo{f*,a*) = vo{fa*,a*), and thus vo{fa',a*) > 
Vo{f, a) for all (f, a) E J-' x A such that a E £{/)■ This proves that {fa*, a*) is an intervention 
equilibrium. ■ 

Proposition |4] shows that it is without loss of generality to restrict attention to pairs of the form 
{fa, a) when we ask whether a given action profile is sustainable and whether there exists an 
intervention equilibrium. The basic idea is that, in order to sustain an action profile, it suffices 
to consider an intervention rule that punishes a deviator most severely. The role of extreme 
intervention rules is analogous to that of optimal penal codes ifTSl in repeated games with 
perfect monitoring. The following proposition characterizes intervention equilibria among pairs 
of the form (/a, a). 

Proposition 5: {fa*, a*) is an intervention equilibrium if and only if a* G £* and Uo{aQ, a*) > 
Mo(ag, a) for all a E £*. 

Proof: Suppose that {fa*, a*) is an intervention equilibrium. Then a* E £{fa*), and thus 
a* E £*. Also, Vo{fa*,a*) > Vo{f, a) for all (/, a) such that a E £{f). Choose any a E £* . Then 
a E £{fa), and thus Mo(ao,a*) = Vo{fa*,a*) > vo{fa,a) = Uo{aQ,a). 

Suppose that a* E £* and Uo{aQ,a*) > UQ{aQ,a) for all a E £*. To prove that {fa*, a*) is an 
intervention equilibrium, we need to show that (i) a* E £{fa*), and (ii) Vo{fa*,a*) > VQ{f, a) for 
all (/, a) such that a E £{f). (i) follows from a* E £*. To prove (ii), choose any (/, a) such that 
a E £{f). By Lemma [3l we have a E £*. Then vo{fa*,a*) = uo{aQ,a*) > Uo{a^,a) > vo{f,a), 
where the first inequality follows from a E £*. ■ 

Proposition [5] shows that the pair {fa, a) constitutes an intervention equilibrium if a solves 

maxMo(ao,a). (4) 

a££* 

The next proposition provides a sufficient condition under which an intervention equilibrium 
exists. 

Proposition 6: If Ai is a bounded set in Euclidean space for all i E Af, then there exists an 
intervention equilibrium. 
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Proof: By Proposition |4tii) and Proposition [51 an intervention equilibrium exists if and only 
if there exists a solution to the problem dH). Since uq^o^, a) is continuous in a, the result follows 
if we show that the constraint set S* is compact. Since £* C A and A is bounded, £* is also 
bounded. Let Gj(a) = argminaogAo Ui{aQ, a) for all a E A, for all i E M. By the Theorem of the 
Maximum [|T3l . Gi{a) is compact-valued and u.h.c. To show that 8* is closed, choose a sequence 
{a"} with a" — t- a* and a" G 8* for all n. Choose any i E M and G Ai. Let {oq } be a sequence 
such that Og G Gi{a'^,a\) for all n. Since a" G 8{fa^), we have Ui{aQ,a'^) > Mi(aQ, a-, a" •). 
Also, since (^^(a) is u.h.c, there exists a convergent subsequence of {ag} whose limit point 
is in Gi{a[,a*_j). Since is continuous, we obtain Ui{aQ,a*) > Uj(ao,a-,alJ by taking limits. 
This proves a* E 8{fa*) and thus a* E 8*. ■ 

Now we turn to the question of whether the best feasible performance, ^o, can be achieved with 
intervention. At an intervention equilibrium of the form (fa*, a*), intervention exists only as a 
threat to deter deviation, and no intervention is exerted as long as the users follow the target action 
profile. This contrasts with the imperfect monitoring scenario considered in Proposition [21 where 
providing incentives requires that intervention be used sometimes even when the users follow 
the target action profile, which results in a performance loss. Thus, with perfect monitoring, 
it is possible for an intervention scheme to achieve the best feasible performance as long as 
the intervention capability is sufficiently strong. This discussion is formally stated below as a 
corollary of Proposition [5l Note that vq = sup^g^ uo{(k)^ '^)' which is attained when A is compact. 

Corollary 1: If a° E arg max^gyi Mo(ao, a) and Ui{aQ,a") > Ui{fao{ai,a°_^),ai,a°_j) for all 
tti E Ai, for all i E M, then = vq. 

Extreme intervention rules are useful to characterize sustainable action profiles and intervention 
equilibria. However, they may not be desirable in practice. For example, when a user chooses an 
action different from the target action by mistake (i.e., trembling hands), an extreme intervention 
rule triggers the most severe punishment for the user, which may result in a large performance 
loss. Thus, it is of interest to investigate intervention rules that use weaker punishments than 
extreme intervention rules do. To obtain concrete results, we assume that Ai = [a^, a,] C M with 
ttj < ttj for all 2 G Ao in the remainder of this subsection. Below we define another class of 
intervention rules. 

Definition 4: fa^c : A — > is a (truncated) affine intervention rule with target action profile 
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a E A and intervention rate profile c G if 

/a,c(a) = [c ■ (a - S) + aol^ , 

where [x]'^ = mm{max{x, a}, /?}. 

The following proposition constructs an affine intervention rule to sustain an interior target 
action profile in the differentiable payoff case. 

Proposition 7: Let a* G ^ be an action profile such that a* G (aj,ai) for all i G M. Suppose 
that, for all i G A/", Ui is twice continuously differentiable and Mj(ao, a*) is strictly decreasing in 
ao on [ao,ao]. Let 

dui{aQ, a*) /dai 



I 



dui{qQ, a*)/dao 



for all i G MB Suppose that 



daj 



'oq, ai, a*_j) < for all G (Oj, a^) 



for alH G A/" such that c* = 0, 

d'^Ui 



^-2-vao, Oi, alj < for all G (a^, a*), 



(5) 



c*?^ + 2c*——^ + 



dai 



daidao da^ 



< 



(ao,ai,a_i)=(c*(ai-a*)+aQ,a,;,al-) 

for all ai G (a*, min{aj, a* + (oq — aQ)/c*}), and 



(ao, ai, alj) < for all a, G (a* + (ao — a^ 



for alH G TV such that c* > 0, and 



du 



-— (ao, aj, a*-) > for all a^ G (a^, a* + (ao - ao)/c* 
daj 



c*) + 2c* ^ ^ + 



dai 



daidan da^ 



< 



(ao,ai,a-i)={c*{ai-a*)+afj,ai,a*_-) 

for all ai G (max{ai, a* + (ao — ao)/c*}, a*), and 



^We define dui{a^,a*) / dao as the right partial derivative of Ui with respect to ao at (aq, 
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-^(oq, ai, a%) < for all G (a* , a^) 

for alH G A/" such that c* < Then /a* c* sustains a*. 

Proof: See the Appendix of [IJ. ■ 
Note that dui{aQ, a*)/daQ < for alH G A/" since Mj(ao, a*) is strictly decreasing in oq. Thus, 
c*, defined in Q, has the same sign as dui{aQ,a*)/dai. With = [ao,ao]j the intervention 
action can be interpreted as the intervention level, and at the target action profile a* the users 
receive higher payoffs as the intervention level is smaller. The affine intervention rule fa*,c*, 
constructed in Proposition Ul has the properties that the intervention device uses the minimum 
intervention level Oq when the users choose the target action profile a*, i.e., fa*,c*{ci*) = c^O' ^'^^ 
that the intervention level increases in the rate of |c,*| as user i deviates to the direction in which 
its payoff increases at (o-q, a*). The expression of c* in ([5]) has an intuitive explanation. Since c* 
is proportional to dui^a^, a*) / dai and inversely proportional to —dui{aQ, a*) / daQ, a user faces 
a higher intervention rate as its incentive to deviate from (ag, a*) is stronger and as a change in 
the intervention level has a smaller impact on its payoff. The intervention level does not react 
to the action of user i when c* = 0, because user i chooses a* in its self-interest even when the 
intervention level is fixed at Oq, provided that the other users choose a*_^. Finally, we note that 
if (/*, a*) is an intervention equilibrium and fa*,c sustains a* for some c, then {fa*,c, o.*) is also 
an intervention equilibrium, since /a*,c(«*) = Oq- 

B. Illustrative Example (Type-1 Intervention) 

As an illustrative example, we consider another resource sharing scenario in a wireless network 
where N > 2 users and an intervention device interfere with each other. In this example, the 
intervention device engages in type-1 intervention, affecting the service quality through its usage 
level. The actions of the users and the intervention device are their usage levels, and the action 
space is given by Ai = [0,aj] for all i G Aq. denotes the maximum usage level of user i, and 
ao denotes that of the intervention device, which can be considered as its intervention capability. 
We assume that > q/2b for all i G M, while imposing no restriction on qq- The service quality 



^We define (a, /3) = if a > /3. 
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is determined by the total usage level, ao + Y^ 



N 
i=l 



tti, following the relationship 



) 



+ 



Q{ao,a)= q-biao + y^^ai 



i=l 



where q,b > and = max{x, 0}. The payoff of user i E J\f is given by the product of the 
service quality and its own usage level, 



Ui{ao, a) is weakly decreasing in qq for all a, and thus we can consider an extreme intervention 
rule that takes the value Oq whenever a unilateral deviation occurs. 

In this example, we have vq = q'^/ANb, which is achieved when = and J2iLi '^i = ?/2&. 
The symmetric action profile that attains Uq is thus {ai, . . . ,ai), where ai = q/2Nb. On the other 
hand, the best performance at the non-cooperative equilibrium without intervention (i.e., when 
ao is held fixed at 0) is given by vq = q^ /{N + 1)^6, which is attained at (a/i, . . . , a^), where 
dh — q/{N + l)b. Note that ah > ai. Hence, the goal of the manager is to limit the usage levels 
of the users by using intervention as a threat. In the following proposition, we investigate the 
best performance with intervention, Vq, as we vary Hq. 

Proposition 8: (i) Vq = vq if and only if oq = 0. 

(ii) = vo if and only if oq > oJJ^*" = (ViV - lfql2Nb. 

(iii) fQ is strictly increasing with Hq on [0,aj{**"]. 

Proof: See Appendix El ■ 
Since Ui is weakly decreasing in oq, the set 8* is weakly expanding as the intervention 
capability ao is larger. This implies that the performance with intervention is weakly increasing 
with Hq. Proposition [8] shows that the performance with intervention improves as Hq increases, 
eventually reaching the best feasible performance when oq > aJJ^™. Thus, aJJ^™ can be interpreted 
as the minimum intervention capability for an intervention scheme to achieve the best feasible 
performance. We can show that oj,™" is increasing and concave in N . Fig. [5] plots the set 
8* = 8{F) as dark regions for the different values of with parameters N = 2, q = 12, 
6 = 1, and ai = 02 = 12. We can see that 8* expands as Qq increases. When = 0, 8* has 



Mi(ao, a) = Q{ao, a)ai. 



(6) 



The payoff of the manager is given by the average payoff of the users. 




i=l 
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only two elements, {ah, ct/i) = (4, 4) and (12, 12). When Hq = 0.1, there are more action profiles 
in £*. However, the symmetric social optimum {ai,ai) = (3,3) does not belong to £*, and 
Proposition [5] implies that the action profile (01,02) that minimizes ai + 02 among those in £* 
constitutes an intervention equilibrium. When Hq > (v^ — l)^g/46 ^ 0.51, the action profiles in 
£* that satisfy ai + 02 = 2a; = 6 constitute an intervention equilibrium, as all of them yield the 
best feasible performance vq. When oq > q/b = 12, the punishment from ciq is strong enough 
to make any action profile sustainable, i.e., £* = A. 

Applying Proposition Ul we can construct an affine intervention rule that sustains an action 
profile a* such that a* G (0,aj) for alH G A/" and J^iLi^^i < q/^, provided that the maximum 
intervention level ao is sufficiently large. With the payoff functions in the expression of c* 
in ([5]) is given by 

ba* a* 

for all i G M. For example, the affine intervention rule with target action profile (a/, . . . , a/) and 
the corresponding intervention rate profile c*{ai, . . . , ai) is expressed as 



N 



ao 

f{a) ' 



(iV-1) 5^a,- 



(7) 



Fig. [6] considers N = 2 and plots the payoff of user i against its action a^, provided that the 
manager chooses the intervention rule in (|7]) and the other user chooses a;. It also assumes that 
ao is sufficiently large. Without intervention, the best response of user i to ai is 3q/8b, which 
shows the instability of the symmetric social optimum (ai,ai). However, when the intervention 
rule ([7]) is used, the intervention device begins to intervene as user i increases its usage level 
from ai. An increase in payoff due to the increased usage level is more than offset by a decrease 
in payoff due to the quality degradation from intervention. As a result, users do not gain by a 
unilateral deviation from (a;, a;) under the intervention rule ©. 

V. Comparison with Existing Approaches 

The literature has studied various methods to improve non-cooperative outcomes. One such 
method is to use contractual agreements. Contract theory is a field of economics that studies how 
economic actors form contractual agreements, covering the topics of incentives, information, and 
institutions 11161 . Since intervention schemes aim to motivate users to take appropriate actions. 
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our work shares a theme as well as a formal framework with contract theory. However, most 
works in contract theory deal with the principal-agent problem using monetary payment as 
the incentive device (see, for example, [17]). In contrast, our work focuses on the problem of 
regulating selfish behavior in resource sharing by using intervention within the system as the 
incentive device. 

In game theory, correlated equilibrium is a solution concept that extends Nash equilibrium and 
thus has the potential to improve Nash equilibrium. A correlated equilibrium can be implemented 
by having a mediator who determines an action profile following a correlated distribution and 
makes a confidential recommendation to each player |fT8| . In an intervention game, the manager 
recommends a pure or mixed action profile to users but does not use a correlated distribution 
to determine the target action profile. Another difference is that an intervention scheme uses 
an external punishment device to prevent deviation, which is not present in the concept of 
correlated equilibrium. We also note that, for the prisoner's dilemma game where there is a 
dominant strategy for each player, the set of correlated equilibria coincides with that of Nash 
equilibria. This suggests that correlated equilibrium is more useful for inducing coordination (see, 
for example, [fT9|, which considers a multiple access network) than for achieving cooperation in 
a prisoner's dilemma scenario, as considered in this paper. 

Another method used in game theory to expand the set of Nash equilibria is repeated games. 
In a repeated game, players monitor their behavior and choose their actions based on past 
observations (see, for example, [5] and [|20ll for works that apply the idea of repeated games 
to wireless communications). Implementing an incentive scheme based on a repeated game 
strategy requires long-term relationship among interacting users, which may not exist especially 
in mobile, cognitive, and vehicular networks. Moreover, a repeated game strategy should be 
designed in accord with the self-interest of players in order to ensure that they execute monitoring 
and punishment or reward in a planned way. On the contrary, an intervention scheme uses an 
external device for monitoring and executing punishment. Hence, it can provide incentives for a 
dynamically changing population, and the manager can prescribe any feasible intervention rule 
according to his objective. 

In the communications literature, Stackelberg games have been used to improve Nash equilib- 
rium (see, for example, [6J and [21J). Stackelberg games divide users into two groups, a leader 
and followers, and the leader takes an action before the followers do. In intervention games, the 



21 



manager is the leader while users are followers, and the manager chooses an intervention rule, 
which is a contingent plan, instead of an action. Thus, intervention games are more suitable than 
Stackelberg games when the leader is not a resource user but a manager who regulates resource 
sharing by users. 

Pricing schemes or taxation can also be used to induce individuals to take socially desirable 
actions. Intervention affects the payoffs of users by directly influencing their resource usage, 
whereas pricing does so by using an outside instrument, money. Thus, intervention schemes 
can be implemented more robustly in that users cannot avoid intervention as long as they use 
resources. In order to achieve a desired outcome through an incentive scheme, the manager needs 
to know the impact of the incentive device on the payoffs of users. Since intervention affects 
the payoffs of users through physical quantities associated with resource usage (e.g., throughput, 
delay), which are easily measurable, this information is easier to obtain when the manager uses 
an intervention scheme than a pricing scheme, as discussed at the end of Section IIII-B[ 

Lastly, we discuss the difference between intervention and mechanism design in the sense of 
[ [22l Ch. 23]. In a mechanism design problem, the designer aims to obtain the private information 
of agents while he can control the social choice (e.g., a resource allocation). On the contrary, in 
an intervention game, the manager aims to motivate users to take appropriate actions while he 
has complete information about users (i.e., no private information). 

VI. Conclusion and Future Research 

In this paper, we have developed a game-theoretic framework for the design and analysis of 
intervention schemes, which are aimed to drive self-interested users towards a system objective. 
Our results suggest that the manager can construct an effective intervention scheme when he 
has an intervention device with an accurate monitoring technology and a strong intervention 
capability. We have illustrated our framework and results with simple resource sharing scenarios 
in wireless communications. However, the application of intervention schemes is not limited 
to the problems considered in this paper; our framework can be applied to a much broader 
set of problems in communications, including power control and flow control, as well as to 
various types of networks such as cognitive radio, vehicular networks, peer-to-peer networks, 
and crowdsourcing websites. Exploring the role of intervention in various specific scenarios is 
left for future research. Another direction of future research is to combine intervention with other 
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game-theoretic concepts. First, we can introduce intervention in repeated games, where users and 
the intervention device choose their actions depending on their past observations. We can also 
allow the intervention manager to use a correlated distribution, as in correlated equilibrium, when 
he determines the target action profile. Intervention can then be exerted when a user deviates 
from the recommended action. We can use the idea of mechanism design to deal with a scenario 
where the intervention manager has incomplete information about users. In such a scenario, the 
manager first obtains reports from users and then chooses an intervention rule depending on the 
reports. Finally, intervention can be used in the context of bargaining games, where the set of 
feasible payoffs in a bargaining game is obtained from sustainable action profiles. 

Appendix A 
Proof of Proposition [3] 

Proof: First, note that wo(0) > limQ,_^o+ wq{q). Suppose that pa/,— ga^ < and qaL—ran < 
0. Then wo{a) = for all a G (0, 1], and thus Vq = wo{0) = vq. This covers condition (a) 
in Proposition [3l Now suppose that at least one of the two inequalities puL — qan > and 
qai — ran > holds. We consider three cases. 
Case 1: pai — qan = Qcll — ran- 

In this case, a{paL — qan) + (1 — a){qaL — ran) > is satisfied for all a E (0, 1], and Wo{a) 
is increasing on (0, 1]. Thus, we obtain Vq = max{u7o(0), wo(l)}. 
Case 2: pai — qan > qai — ran- 
a{paL — qan) + (1 — (y){qaL — ran) > if and only if 

a > -(gflL - rag) 

~ {pai - qan) - {qaL - ran) ' 
where the right-hand side of ([8]) is smaller than 1. Also, pa l — qan > qo-L — ^clh implies 
p — q > q — r. We can show that the sign of the first derivative of wq at any a E (0, 1) is equal to 
that of (]9 — g)(l — r) — (g — r)(l — g), which is positive. Hence, we have Vq = max{wo{0),wo{l)}. 
Combining Cases 1 and 2 covers conditions (c) and (d). 
Case 3: pai — qan < qai — ran- 

ot{pciL — qan) + (1 — OL){qa'L — ran) > if and only if a < a. Also, the sign of the first 
derivative of wq is equal to that of (p — — t) — (g — r)(l — q). 
Case 3-1: < paL — qan < q^L — ran- 
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We have a > 1. Thus, Wq is increasing on (0, 1] if (p — g)(l — r) — {q — r)(l — g) > and 
non-increasing if {p — — r) — (g — r)(l — g) < 0. 
Case 3-2: pai — qa^ < < qa^ — rau. 

We have a <1. Thus, wq is increasing on (0, a] if {p — g)(l — t) — (g — r)(l — g) > and 
non-increasing if {p — q){l — r) — (q — r)(l — g) < 0. 

These results cover conditions (b), (e), and (f). ■ 

Appendix B 
Proof of Proposition [8] 

Proof: (Sketch) Note that Uq{0, a) depends on a only through J2iLi '^i- Wo(0, a) is increasing 
in J2iLi '^i — Xlili — Q'/2^, reaches the maximum at cti = g/26, is decreasing in 
^^]^ di for g/26 < Xlili — 0'/^' ^'^d remains at zero for J2iLi '^i ^ l/^- 

(i) If oo = 0, then Vq = vo by definition. To show the converse, suppose that ao > 0. Since the 
payoff function is continuous, we can show that (a/t — e, . . . , a/i — e) is sustainable for sufficiently 
small e > 0, which yields Vq > Vq. 

(ii) We have Vq = vq if and only if there exists a sustainable action profile a such that 
12iLi(^i = q/2b. Given J^iLi^i — Q'/2^, the incentive for user i to deviate is stronger as 
tti is smaller. Hence, it suffices to check whether the symmetric action profile (a;,..., a;) is 
sustainable. By Lemma [3l {ai, . . . ,ai) is sustainable if and only if 

max [g - 6(ao + (iV - l)ai + aj)]+aj < q^/ANb, 

aie[0,ai] 

which is equivalent to oq > (V^ — l)'^q/2Nb. 

(iii) Choose Hq, Oq G [0, a^^"] with < Oq. Let and (v^)' be the corresponding performances 
with intervention. Since < ao < o^*", there exists an action profile a that attains with 
intervention capability qq and satisfies g/26 < J^iLi'^i — Nq/(N + 1)6. We can show that 
(ai — e, . . . , Oat — e) can be sustained with for sufficiently small e > 0, which implies 

iv*y > V*. m 
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TABLE I 

Payoff matrix of the game r ; in the illustrative example in Section QiI-BI 





aL 


an 


aL 


VpaL, UpCiL 


UqUL, UqaH 


an 


UqaH, VqaL 


Vran, Vran 



Users choose their resource usage levels. 








< 


The service quality is determined based on 
the usage levels of users. 






< 


r 


The payoff of a user is determined 
depending on its own usage level and the 
service quality. 





Type-1 intervention: The 
intervention device affects the 
service quality through its own 
usage level. 



Type-2 intervention: The 
intervention device controls the 
service quality delivered to users. 



Fig. 1. Two types of intervention in a resource sharing scenario. 
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Payoff of 
user 2 



\Jncreasing direction of 
'system performance 



System 
performance 



Incentive 
Jjrovisiori ty 
interventioii 




Social optimum 
(Both users use the 
resource moderately.) 

X Intervention 
fequilibrium 



Non-cooperative equilibHum 
(Both users use the resource 
excessively.')-. 



> 

Payoff of 
user I 



y (best feasible 
performance) 

* (best performance 



V. 



with intervention) 



„ (best performance at 

non-cooperative equilibrium 
without intervention) 



Fig. 2. Performance improvement through an intervention scheme. (The system performance is given by the average payoff, 
and a dotted line represents the set of payoff profiles that yield the same system performance.) 



Transmitter 1 
(User 1) 



packets 
— > 



Transmitter 2 
(User 2) 



— > 

packets 



(Intervention 
device) 



-) Processor 



Gatekeeper 



No intervention 
Packets are 
delivered. 



1 

I 
I 

\/ 

Intervention: 
Packets are 
dropped. 



^ Receiver 



Usage level of a user: the number of packets it places to the queue per second. 

Service quality: the service rate, i.e., the ratio of the number of packets 
processed to the total placed. (The service rate is affected by the congestion level 
as well as random factors such as channel conditions.) 

Payoff of a user: its data rate, i.e., the number of its packets transmitted to the 
receiver per second. 



Fig. 3. A communication scenario that fits into the example in Section Ull-B I 
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(C) 

4. The graph of the function wo{a) defined in l2ll:[(a)]wo = vo (p ~ 0.9), |(b)| «o — wo{a) (p = 0.94), and[(c)]uo = wo 
0.96). (The dotted hnes display vo ~ ypiL-) 
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(c) ao = 5 

Fig. 5. Plot of £* as dark regions for the different values of ao in the example in Section HV-BI 
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Fig. 6. Plot of Ui against Ui wlien tiie manager ciiooses tiie affine intervention rule Q and the other user chooses a;. 



